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1. Applicant herewith submits to the United States Designated Office (DO/US) the followina 
items under 35 U.S.C. 371: 

a. f$ This express request to immediately begin national examination procedures (35 

U.S.C. 371(f)). 

b. 0 The U.S. National Fee (35 U.S.C. 371(c)(1)) and other fees (37 CFR 1.492) as 

indicated below: 

2. Fees 


CLAIMS 
FEE 

(1) FOR 

(2) NUMBER 
FILED 

(3) NUMBER 
EXTRA 

(4) RATE 

(5) CALCULA- 
TIONS 

□* 

TOTAL 
CLAIMS 

4 -20 = 

0 

X$ 22.00 = 

$ 


INDEPENDENT 
CLAIMS 

3 ^3 = 

0 

X $ 82.00'= 



MUTIPLE DEPENDENT CU\lM(S) (if applicable) 

+ $270.00 


BASIC FEE** 

The international search fee, as set forth in § 1.445(a)(2) to be 
paid to the US PTO acting as an international Searching 
Authority: 



□ has been paid (37 CFR 1.492(a)(2)) ..... 

$790.00 



□ has not been paid (37 CFR 1.492(a)(3)) 

$1070.00 



&] where a search report on the international application 
has been prepared by the European Patent Office or 
the Japanese Patent Office (37 CFR 

1.492fa){5» «oinnn 

930.00 







Total of above Calculations 

930.00 

SMALL 
ENTITY 

Reduction by 1/2 for filing by small entity, if applicable. Affidavit 
must be filed also, (note 37 CFR 1.9, 1.27, 1.28) 



Subtotal 

930.00 


Total National Fee 

$ 930.00 


Fee for recording the enclosed assignment document $40.00 (37 
CFR 1.21(h)). (See Item 10 below). See attached "ASSIGNMENT 
COVER SHEET (37 CFR 3.34)*\ 


TOTAL 

Total Fees enclosed 

* 930.00 


See attached Preliminary Amendment Reducing the Number of Claims. 
" WARNING: To avoid abandonment of the application, the applicant shall furnish to the United States Patent 
and Trademark Office not later than the expiration of 20 months from the priority date' ' ' * 
(2) the basic national fee (see § 1.492(a)). The 20-month time limit may not be extended " 37 
CFR § 1.494(b). 
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i. A check in the amount of $ 930.00 to cover the above fees is enclosed. 

ii. □ Please charge Account No. in the amount of 

$ A duplicate copy of this sheet is enclosed. 

WARNING; If the translation of the international application, oath or declaration and national fee have not 
been submitted by the applicant within twenty (20) months from the priority date, the applicant 
will be so notified and given a period of time within which to hie the translation and/or oath or 
declaration in order to prevent abandonment The payment of the surcharge set forth in § 1.492(e) 
is required as a condition for accepting the oath or declaration later than twenty (20) months after 
the priority date. The payment of the processing fee set forth in § 1.492(f) is required for 
acceptance of an English translation later than twenty (20) months after the priority date. Failure 
to comply with these requirements will result in abandonment of the application. The provisions 
of § 1.136 will apply. 37 CFR § 1.494(c) and Notice of January 7, 1993, 1147 OG 29 to 40 
at 35. 

3. A copy of the international application as filed [35 U.S.C. 371(c)(2)]: 

a. □ is transmitted herewith. 

b. □ is not required as the application was filed with the United States Receiv- 
ing Office. 

c. K] has been transmitted 

i. [8 by the Internationa! Bureau. Date of mailing of the application (from 
form PCT/IB/308): 26 February 19 98 

ii. □ by applicant on (date) 

NOTE: Section 1.494(b) was amended to require that the basic national fee and a copy of the international 
application must be filed with the Office by 20 months from the priority date to avoid abandonment 
'The International Bureau normally provides the copy of the international application to the Office in 
accordance with PCT Article 20. At the same time, the International Bureau notifies the applicant of 
the communication to the Office. In accordance with PCT Rule 47. 1, that notice shall be accepted by 
all designated offices as conclusive evidence that the communication has duly taken place. Thus, if the 
applicant desires to enter the national stage and applicant has received notice from the International 
Bureau, applicant need only pay the basic national fee by 20 months from the priority date." Notice 
of January 7, 1993, 1147 O.G. 29 to 40, at 35. 

4. $ A translation of the international application into the English language [35 U S C 

371(c)(2)]: 

a. Qt) is transmitted herewith. 

b. □ is not required as the application was filed in English. 

c. □ was previously transmitted by applicant on (date) 


5. QQ Amendments to the claims of the international application under PCT Article 19 
[35 U.S.C. 371(c)(3)]: 

NOTE: The Notice of January 7, 1993 indicates that 37 CFR § 1.494(d) was "amended to clarify the existing 
practice that PCT Article 19 Amendments must be submitted by 20 months from the priority date, which 
time may not be extended. " This Notice further advises: "Of course, the failure to do so does not result 
in loss of the subject matter of PCT Article 19 amendments. The applicant may submit that subject 
matter in a preliminary amendment filed under Section 1.121. tnmanycases, fthng an amendment under 
Section 1 121 is preferable Since grammatical or idiomatic errors may be corrected. " 1 147 O.G 29-40 
at 35. See item 1 1(c) below. 

a. □ are transmitted herewith. 
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b. □ have been transmitted 

i. □ by the International Bureau. Date of mailing of the amendment (from 
form PCT/IB/308): 

ii- □ by applicant on (date) 

c. K] have not been transmitted as 

i. □ no notification has been received that the international Search 
Authority has received the Search Copy. 

ii. □ the Search Copy was received by the International Searching Au- 
tribrity but the Search Report has not yet been issued. Date of receipt 
of Search Copy (from form PCT/ISA/202): 

iii. D_ applicant chose not to make amendments under PCT Article 1 9. 
Date ^aiJi&QU °f Search Report (from form PCT/ISA/21 0.): 


iv. □ the time limit for the submission of amendments has not yet expired. 
The amendments or a statement that amendments have not been made 
wrii be transmitted before the expiration of the time limit under PCT Rule 
46.1. 


6. [3 A translation of the amendments to the claims under PCT Article 19 [35 U.S.C. 


a. □ is transmitted herewith. 

b. □ is not required as the amendments were made in the English language. 

c. DO has not been transmitted for reasons indicated at point 5.c. above. 


7. __ An oath or declaration of the inventor [35 U.S.C. 371(c)(4)] complying with 35 
U.S.C. 115 

a. □ was previously submitted by applicant on (date) 

b. □ is submitted herewith and such oath or declaration 


L □ is attached to the application. 

ii. □ identifies the application and any amendments under PCT Article 
19 which were transmitted as stated in points 3.b. or c. and 5.b; and 
states that they were reviewed by the inventor as required by 37 CFR 
1.70. 

ni. __i will follow. 


II. Other document(s) or information included; 
8. (X] An international Search Report or Declaration under PCT Article 17(2)(a): 
a. □ is transmitted herewith. 


b. G_ has been transmitted by the international Bureau. Date of mailing (from 
form pct/ib/308): 26 February 199 8 

c. □ is not required as the application was searched by the United States 
International Searching Authority. 



371(c)(3)]: 
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d. □ will be transmitted promptly upon request. 

e. □ has been submitted by applicant on (date) _ 

f. □ is not transmitted as the international search has not yet issued. 

9. CO An Information Disclosure Statement under 37 CFR 1.97 and 1.98: 

a. □ is transmitted herewith. 

Also transmitted herewith is 

□ Form PTO— 1449 

□ Copies of citations listed 

b. [X] will be transmitted within THREE MONTHS of the date of submission 
of requirements under 35 U.S.C. 371(c). 

c. □ was previously submitted by applicant on (date) 

10. □ An assignment document is transmitted herewith for recording. A separate 

□ "COVER SHEET FOR ASSIGNMENT (DOCUMENT) ACCOMPANYING NEW 
PATENT APPLICATION" or □ FORM PTO— 1595 is also attached. 

□ Please mail the recorded assignment document to: 

i. □ the person whose signature and address appears below. 
(L □ the following: 

11. Additional documents 

a. □ Copy of request (PCT/RO/101) 

b. &3 International Publication No. W098/0 7840 

i. □ Specification, claims and drawing 

ii. §0 Front page only 

c. □ Preliminary amendment (37 CFR § 1.121) 

d. Bother: Certificate of Biological Deposit 

Computer Readable Copy of Sequence Listing (Disk) 

12. ffl The above checked items are being transmitted 

a. □ before the 18th month publication. 

b. &] after publication and the article 20 communication but before 20 months 
from the priority date. 

c. □ after 20 months (revival). 

NOTE: Petition to revive (37 CFR 1. 137(a) or (b)) is necessary if 35 U.S.C. 371 requirements are submitted after 
20 months. 

13. □ Certain requirements under 35 U.S.C. 371 were previously submitted by the 

applicant on , namely: 

^ (date) y 
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AUTHORIZATION TO CHARGE ADDITIONAL FEES 


WARNING: Accurately count claims, especially multiple dependant claims, to avoid unexpected high charges 
if extra claims are authorized. 

03 The Commissioner is hereby authorized to charge the following additional fees 
which may be required by this paper and during the entire pendency of this 
application to Account No. 20-0531 — 

HI 37 CFR 1.492(a)(1), (2), (3), and (4) (filing fees) 

WARNING: Because failure to pay the national fee within 20 months without extension (37 CFR § 1.494(b)(2)), 
results in abandonment of the application, it would be best to always check the above box. 

ffl 37 CFR 1.492(b), (c), and (d) (presentation of extra claims) 

NOTE: Because additional fees for excess or multiple dependent claims not paid on filing or on later presentation 
must only be paid or these claims cancelled by amendment prior to the expiration of the time period 
set for response by the PTO in any notice of fee deficiency (37 CFR 1. 16(d)), it might be best not to 
authorize the PTO to charge additional claim fees, except possibly when dealing with amendments after 
final action. 

□ 37 CFR 1.17 (application processing fees) 

WARNING: While 3 7 CFR 1. 1 7(a), (b), (c) and (d) deal with extensions of time under § 1. 136(a) this authorization 
should be made only with the knowledge that: "Submission of the appropriate extension fee under 
37 CFR 1. 136(a) is to no avail unless a request or petition for extension is filed" (Emphasis added). 
Notice of November 5, 1985 (1060 O.G. 27). 

□ CFR 1.18 (issue fee at or before mailing of Notice of Allowance, 
pursuant to 37 CFR 1.311(b). 

NOTE: Where an authorization to charge the issue fee to a deposit account has been filed before the mailing 
of a Notice of Allowance, the issue fee will be automatically charged to the deposit account at the time 
of mailing the notice of allowance. 37 CFR 1.311(b). 

NOTE: 37 CFR 1.28(b) requires "Notification of any change in Joss of entitlement to small entity status must 
be filed in the application . . . prior to paying or at the time of paying . . . issue fee". From the wording 
of 37 CFR 1.28(b): (a) notification of change of status must be made even if the fee is paid as "other 
than a small entity" and (b) no notification is required if the change is to another small entity. 

□ 37 CFR 1 .492(e) and (f) (surcharge fees for filing the declaration and/or 
filing an English translation of an international Application later than 20 
months after the priority date. 



Reg. No. 32,503 


Tel. No. ( 61 7) 248-7263 


^t%^ix?^^ a fl^be^n 9 LLP 
High Street Tower / 125 High Street 

(P.O. Address) 

Boston, MA 02110 
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DESCRIPTION 


NOVEL DNA AND PROCESS FOR PREPARING PROTEIN USING THE DNA 


The present invention relates to a novel DNA and a process 
for preparing a protein which possesses an activity to inhibit 
osteoclast differentiation and/or maturation (hereinafter 
called osteoclastogenesis-inhibitory activity) by a genetic 
engineering technique using the DNA. More particularly, the 
present invention relates to a genomic DNA encoding a protein 
OCIF which possesses an osteoclastogenesis-inhibitory activity 
and a process for preparing said protein by a genetic engineering 
technique using the genomic DNA. 


resorption and formation. Osteoblasts controlling formation 
of bones and osteoclasts controlling resorption of bones take 
major roles in this process. Osteoporosis is a typical disease 
caused by abnormal metabolism of bones . This disease is caused 
when bone resorption by osteoclasts exceeds bone formation by 
osteoblasts. Although the mechanism of this disease is still 
to be elucidated completely, the disease causes the bones to 
ache, makes the bones fragile, and may results in fracturing 
of the bones. As the population of the aged increases, this 
disease results in an increase in bedridden aged people which 
becomes a social problem. Urgent development of a therapeutic 
agent for this disease is strongly desired. Disease due to a 


FIELD OF TECHNOLOGY 


BACKGROUND OF THE INVENTION 


Human bones are constantly repeating a process of 
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decrease in bone mass is expected to be treated by controlling 
bone resorption, accelerating bone formation, or improving 
balance between bone resorption and formation. 

Osteogenesis is expected to increase by accelerating 
proliferation, differentiation, or activation of the cells 
controlling bone formation, or by controlling proliferation , 
differentiation, or activation of the cells involved in bone 
resorption. In recent years, strong interest has been directed 
to physiologically active proteins (cytokines) exhibiting such 
activities as described above, and energetic research is 
ongoing on this sub j ect . The cytokines which have been reported 
to accelerate proliferation or differentiation of osteoblasts 
include the proteins of fibroblast growth factor family (FGF: 
Rodan S. B. et al., Endocrinology vol. 121, p 1917, 1987), 
insulin-like growth factor I (IGF-I: Hock J. M. et al., 
Endocrinology vol. 122, p 254, 1988), insulin growth factor II 
{IGF- 1 1: McCarthy T. et al., Endocrinology vol . 124, p 301, 1989) , 
Activin A (Centrella M. et al., Mol. Cell. Biol., vol. 11, p 
250, 1991), transforming growth factor-p, (NodaM., The Bone, 
vol. 2, p 29, 1988) , Vasculotropin (VaroniqueM. et al . , Biochem. 
Biophys. Res. Commun., vol. 199, p 380, 1994), and the protein 
of heterotopic bone formation factor family (bone morphogenic 
protein; BMP: BMP-2; Yanaguchi A. et al., J. Cell Biol. vol. 
113, p 682, 1991, OP-1; Sampath T. K. et al . , J. Biol. Chem. 
vol. 267, p 20532. 1992, andKnutsenR. et al., Biochem. Biophys. 
Res. Commun. vol. 194, P 1352, 1993). 

On the other hand, as the cytokines which suppress 
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differentiation and/or maturation of osteoclasts, transforming 
growth factor-p (Chenu C, et. al., Proc. Natl. Acad. Sci. USA, 
vol. 85, p 5683, 1988), interleukin-4 (Kasano K. et al., 
Bone-Miner., vol. 21, p 179, 1993), and the like have been 
reported. Further, as the cytokines which suppress bone 
resorption by osteoclast, calcitonin (Bone-Miner., vol. 17, p 
347, 1992 ), macrophage colony stimulating factor (Hattersley 
G. et al., J. Cell. Physiol, vol. 137, p 199. 1988), 
interleukin-4 (Watanabe, K. et al., Biochem. Biophys. Res. 
Commun. vol. 172. P 1035, 1990), and interferon-y (Gowen M. et 
al., J. Bone Miner. Res., vol. 1, p 46.9, 1986) have been 
reported. 

These cytokines are expected to be used as agents for 
treating diseases accompanying bone loss by accelerating bone 
formation or suppressing of bone resorption. Clinical tests 
are being undertaken to verify the effect of improving bone 
metabolism of some cytokines such as insulin-like growth 
factor-I and the heterotopic bone formation factor family. In 
addition, calcitonin is already commercially available as a 
therapeutic agent for osteoporosis and a pain relief agent. At 
present, drugs for clinically treating bone diseases or 
shortening the period of treatment of bone diseases include 
activated vitamin D 3 , calcitonin and its derivatives, and 
hormone preparations such as estradiol agent, ipriflavon or 
calcium preparations. These agents are not necessarily 
satisfactory in terms of the efficacy and therapeutic results. 
Development of a novel therapeutic agent which can be used in 
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place of these agents is strongly desired. 

In view of this situation, the present inventors have 
undertaken extensive studies- As a result, the present 
inventors had found protein OCIF exhibiting an 
osteoclastogenesis-inhibitory activity in a culture broth of 
human embryonic lung fibroblast IMR-90 (ATCC Deposition No. 
CCL186) , and filed a patent application (PCT/JP96/00374) . The 
present inventors have conducted further studies relating to 
the origin of this protein OCIF exhibiting the 
osteoclastogenesis-inhibitory activity. The studies have 
matured into determination of the sequence of a genomic DNA 
encoding the human origin OCIF. Accordingly, an object of the 
present invention is to provide a genomic DNA encoding protein 
OCIF exhibiting osteoclastogenesis-inhibitory activity and a 
process for preparing this protein by a genetic engineering 
technique using the genomic DNA. 

nTRflT.OSURE OF THE INVENTIO N 
Specifically, the present invention relates to a genomic 
DNA encoding protein OCIF exhibiting osteoclastogenesis- 
inhibitory activity and a process for preparing this protein 
by a genetic engineering technique using the genomic DNA. The 
DNA of the present invention includes the nucleotide sequences 
No, 1 and No. 2 in the Sequence Table attached hereto. 

Moreover, the present invention relates to a process for 
preparing a protein, comprising inserting a DNA including the 
nucleotide sequences of the sequences No. 1 and No. 2 in the 
Sequence Table into an expression vector, producing a vector 
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capable of expressing a protein having the following 
physicochemical characteristics and exhibiting the activity of 
inhibiting differentiation and/or maturation of osteoclasts, 
and producing this protein by a genetic engineering technique, 

(a) molecular weight (SDS-PAGE) : 

(i) Under reducing conditions: about 60 kD, 

(ii) Under non-reducing conditions : about 60 kD and about 
120 kD; 

(b) amino acid sequence: 

includes an amino acid sequence of the Sequence ID No, 
3 of the Sequence Table, 

(c) affinity: 

exhibits affinity to a cation exchanger and heparin, and 

(d) thermal stability: 

(i) the osteoclast differentiation and/or maturation 
inhibitory activity is reduced when treated with heat 
at 70°C for 10 minutes or at 56°C for 30 minutes, 

(ii) the osteoclast differentiation and/or maturation 
inhibitory activity is lost when treated with heat 
at 90°C for 10 minutes. 

The protein obtained by expressing the gene of the present 
invention exhibits an osteoclastogenesis-inhibitory activity. 
This protein is effective as an agent for the treatment and 
improvement of diseases involving decrease in the amount of bone 
such as osteoporosis, diseases relating to bone metabolism 
abnormality such as rheumatism, degenerative joint disease, or 
multiple myeloma, and is useful as an antigen to establish an 
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immunological diagnosis of such diseases* 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a result of Western Blotting analysis of 
the protein obtained by causing genomic DNA of the present 
invention to express a protein in Example 4 (iii) , wherein lane 
1 indicates a marker, lane 2 indicates the culture broth of C0S7 
cells in which a vector pWESRaOCIF (Example 4 (iii) ) has been 
transfected, and lane 3 is the culture broth of C0S7 cell in 
which a vector pWESRa (control) has been transfected. 

BEST NO D S F OR CA R RYI NG OUT THE INV ENT IO N 

The genomic DNA encoding the protein OCIF which exhibits 
osteoclastogenesis-inhibitory activity in the present 
invention can be obtained by preparing a cosmid library using 
a human placenta genomic DNA and a cosmid vector and by screening 
this library using DNA fragments which are prepared based on 
the OCIF cDNA as a probe. The thus-obtained genomic DNA is 
inserted into a suitable expression vector to prepare an OCIF 
expression cosmid. A recombinant type OCIF can be obtained by 
trans feet ing the genomic DNA into a host organism such as various 
types of cells or microorganism strains and causing the DNA to 
express a protein by a conventional method. The resultant 
protein exhibiting osteoclastogenesis-inhibitory activity (an 
osteoclastogenesis-inhibitory factor) is useful as an agent for 
the treatment and improvement of diseases involving a decrease 
in bone mass such as osteoporosis and other diseases relating 
to bone metabolism abnormality and also as an antigen to prepare 
antibodies for establishing immunological diagnosis of such 
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diseases. The protein of the present invention can be prepared 
as a drug composition for oral or non-oral administration. 
Specifically, the drug composition of the present invention 
containing the protein which is an osteoclastogenesis- 
inhibitory factor as an active ingredient can be safely 
administered to humans and animals. As the form of drug 
composition, a composition for injection, composition for 
intravenous drip, suppository, nasal agent, sublingual agent, 
percutaneous absorption agent, and the like are given. In the 
case of the composition for injection, such a composition is 
a mixture of a pharmacologically effective amount of 
osteoclastogenesis-inhibitory factor of the present invention 
and a pharmaceutical^ acceptable carrier . The composition may 
further comprise amino acids, saccharides, cellulose 
derivatives r and other excipients and/or activation agents, 
including other organic compounds and inorganic compounds which 
are commonly added to a composition for injection. When an 
injection preparation is prepared using the 

osteoclastogenesis-inhibitory factor of the present invention 
and these excipients and activation agents, a pH adjuster, 
buffering agent, stabilizer, solubilizing agent, and the like 
may be added if necessary to prepare various types of injection 
agents . 

The present invention will now be described in more detail 
by way of examples which are given for the purpose of 
illustration and not intended to be limiting of the present 
invention. 
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Example 1 

< Preparation of a cosmid library > 

A cosmid library was prepared using human placenta 
genomic DNA (Clonetech; Cat. No. 6550-2) and pWE15 cosmid vector 
(Stratagene) . The experiment was carried out following 
principally the protocol attached to the pWElS cosmid vector 
kit of Stratagene Company, provided Molecular Cloning: A 
Laboratory Mannual (Cold Spring Harbor Laboratory (1989) ) was 
referred to for common procedures for handling DNA, E. coli, 
and pharge . 

(i) Preparation of restrictive enzymolysate of human-genomic 
DNA 

Human placenta genomic DNA dissolved in 750 |il of a 
solution containing 10 mM Tris-HCl, 10 mM MgCl 2 , and 100 mM NaCl 
was added to four 1.5 ml Eppendorf tubes (tube A, B, C, and D) 
in the amount of 100 pg each. Restriction enzyme Mbol was added 
to these tubes in the amounts of 0.2 unit for tube A, 0.4 unit 
for tube B, 0.6 unit for tube C, and 0.8 unit for tube D, and 
DNA was digested for 1 hour. Then, EDTA in the amount to make 
a 20 mM concentration was added to each tube to terminate the 
reaction, followed by extraction with phenol/chloroform (1:1) . 
A two-fold amount of ethanol was added to the aqueous layer to 
precipitate DNA. DNA was collected by centrif ugation, washed 
with 70% ethanol, and DNA in each tube was dissolved in 100 jal 
of TE {lOmMHCl (pH8.0) + 1 mM EDTA buff er solution, hereinafter 
called TE) . DNA in four tubes was combined in one tube and 
incubated for 10 minutes at 68°C. After cooling to room 
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temperature, the mixture was overlayed onto a 10%-4 0 % linear 
sucrose gradient which was prepared in a buffer containing 20 
mM Tris-HCl (pH 8.0) , 5 mM EDTA, and 1 mM NaCl in an centrifugal 
tube (38 ml) . The tube was centrifuged at 26, 000 rpm for 24 
hours at 20°C using a rotor SRP28SA manufactured by Hitachi, 
Ltd. and 0*4 ml fractions of the sucrose gradient was collected 
using a fraction collector. A portion of each fraction was 
subjected to 0.4% agarose electrophoresis to confirm the size 
of DNA. Fractions containing DNA with a length of 30 kb (kilo 
base pair) to 40 kb were thus combined. The DNA solution was 
diluted with TE to make a sucrose concentration to 10% or less 
and 2.5-fold volumes of ethanol was added to precipitate DNA. 
DNA was dissolved in TE and stored at 4°C. 

(ii) Preparation of cosmid vector 

The pWE15 cosmid vector obtained from Stratagene Company 
was completely digested with restriction enzyme BamHI according 
to the protocol attached to the cosmid vector kit. DNA 
collected by ethanol precipitation was dissolved in TE to a 
concentration of 1 mg/ml . Phosphoric acid at the 5' -end of this 
DNA was removed using calf small intestine alkaline phosphatase, 
and DNA was collected by phenol extraction and ethanol 
precipitation. The DNA was dissolved in TE to a concentration 
of 1 mg/ml. 

(iii) Ligation of genomic DNA to vector and in vitro packaging 
1.5 micrograms of genomic DNA fractionated according to 
size and 3 jig of pWE15 cosmid vector which was digested with 
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restriction enzyme BamHI were ligated in 20 [il of a reaction 
solution using Ready-To-Go T4DNA ligase of Pharmacia Company. 
The ligated DNA was packaged in vitro using Gigapack™ II 
packaging extract (Stratagene) according to the protocol. 
After the packaging reaction, a portion of the reaction mixture 
was diluted stepwise with an SM buffer solution and mixed with 
E. coli XLl-Blue MR (Stratagene) which was suspended in 10 mM 
MgCl2 to cause pharge to infect, and plated onto LB agar plates 
containing 50 jug/ml of ampicillin. The number of colonies 
produced was counted. The number of colonies per 1 |xl of 
packaging reaction was calculated based on this result, 
(iv) Preparation oi a cosmid library 

The packaging reaction solution thus prepared was mixed 
with E. coli XLl-Blue MR and the mixture was plated onto agarose 
plates containing ampicillin so as to produce 50,000 colonies 
per agarose p>late having a 15 cm of diameter. After incubating 
the plate overnight at 37°C, an LB culture medium was added in 
the amount of 3 ml per plate to suspend and collect colonies 
of E. coli. Each agarose plate was again washed with 3 ml of 
the LB culture medium and the washing was combined with the 
original suspension of E. coli. The E ♦ coli collected from all 
agarose plates was placed in a centrifugal tube, glycerol was 
added to a concentration of 20%, and ampicillin was further added 
to make a final concentration of 50 fig/ml. A portion of the 
E. coli suspension was removed and the remainder was stored at 
-8 0°C. The removed E. coli was diluted stepwise and plated onto 
an agar plates to count the number of colonies per 1 ml of 
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suspension. 
Example 2 

< Screening of cosmid library and purification _of colony > 

A nitrocellulose filter (Millipore) with a diameter of 
14.2 cm was placed on each LB agarose plate with a diameter of 
15 cm which contained 50 p,g/ml of ampicillin. The cosmid library 
was plated onto the plates so as to produce 50,000 colonies of 
E. coli per plate, followed by incubation overnight at 37°C. 
E. coli on the nitrocellulose filter was transferred to another 
nitrocellulose filter according to a conventional method to 
obtain two replica filters . According to the protocol attached 
to the cosmid vector kit, cosmid DNA in the E. coli on the replica 
filters was denatured with an alkali, neutralized, and 
immobilized on the nitrocellulose filter using a Stratalinker 
(Stratagene) • The filters were heated for two hours at 80°C 
in a vacuum oven . The nitrocellulose filters thus obtained were 
hybridized using two kinds of DNA produced, respectively, from 
5' -end and 3 f -end of human OCIF cDNA as probes. Namely, a 
plasmid was purified from E. coli pKB/OIFlO (deposited at The 
Ministry of International Trade and Industry, the Agency of 
Industrial Science and Technology, Biotechnology Laboratory, 
Deposition No. FERMBP-5267) containing OCIF cDNA. The plasmid 
containing OCIF cDNA was digested with restriction enzymes Kpnl 
and EcoRI . Fragments thus obtained was separated using agarose 
gel electrophoresis. KpnI/EcoRI fragment with a length of 0.2 
kb was purified using a Q1AEX II gel extraction kit (Qiagen) . 
This DNA was labeled with 32 p using the Megaprime DNA Labeling 
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System (Amasham) (5'-DNA probe) . Apart from this, a 
BamHI /EcoRV fragment with a length of 0.2 kb which was produced 
from the above plasmid by digestion with restriction enzymes 
BamHI and EcoRV was purified and labeled with 32 p (3 f -DNA probe) . 
One of the replica filters described above was hybridized with 
the 5' -DNA probe and the other with the 3' -DNA probe. 
Hybridization and washing of the filters were carried out 
according to the protocol attached to the cosmid vector kit. 
Autoradiography detected several positive signals with each 
probe. One colony which gave positive signals with both probe 
was identified. The colony on the agar plate, which 
corresponding to the signal on the autoradiogram was isolated 
and purified. A cosmid was prepared from the purified colony 
by a conventional method. This cosmid was named pWEOCIF. The 
size of human genomic DNA contained in this cosmid was about 
38 kb. 
Example 3 

< Determ.i natA on of the nucl potirie s eque nce of human OCIF genomic 
DNA> 

(i) Subclone ng of OC TF genomic DNA 

Cosmid pWEOCIF was digested with restriction enzyme EcoRI . 
After the separation of the DNA fragments thus produced by 
electrophoresis using a 0 . 7% agarose gel, the DNA fragments were 
transferred to a nylon membrane (Hybond -N, Amasham) by the 
Southern blot technique and immobilized on the nylon membrane 
using Stratalinker (Stratagene) . On the other hand, plasmid 
pBKOCIF was digested with restriction enzyme EcoRI and a 1.6 
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kb fragment containing human OCIF cDNA was isolated by agarose 
gel electrophoresis* The fragment was labeled with 32 P using 
the Megaprime DNA labeling system (Amasham) . 

Hybridization of the nylon membranes described above with 
the 32 P-labeled 1.6-kbOCIF cDNA was performed according to a 
conventional method detected that DNA fragments with a size of 
6 kb, 4 kb, 3.6 kb, and 2.6 kb. These fragments hybridized with 
the human OCIF cDNA were isolated using agarose gel 
electrophoresis and individually subcloned into an EcoRI site 
of pBluescript II SK + vector (Strategene) by a conventional 
method. The resulting plasmids were respectively named pBSE 
6, pBSE 4, pBSE 3.6, and PBSE 2.6. 

(ii) Deter mi nation of the nucleotide sequence 

The nucleotide sequence of human OCIF genomic DNA which 
was subcloned into the plasmid was determined using the ABI 
Dideoxy Terminator Cycle Sequencing Ready Reaction kit (Perkin 
Elmer) and the 373 Sequencing System (Applied Biosystems) . The 
primer used for the determination of the nucleotide sequence 
was synthesized based on the nucleotide sequence of human OCIF 
cDNA (Sequence ID No. 4 in the Sequence Table) . The nucleotide 
sequences thus determined are given as the Sequences No. 1 and 
No. 2 in the Sequence Table. The Sequence ID No. 1 includes 
the first exon of the OCIF gene and the Sequence ID No . 2 includes 
the second, third, fourth, and fifth exons. A stretch of about 
17 kb is present between the first and second exons . 
Example 4 

Pro duct ion o f recombinant OCIF using CQS-7 cells> 
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(i) Preparation of OCIF genomic DNA expression cosmid 

To express OCIF genomic DNA in animal cells, an expression 
unit of expression plasmid pcDL-SRa296 (Molecular and Cellar 
Biology, vol. 8, P466-472, 1988) was inserted into cosmid vector 
pWE15 (Stratagene) . First of all, the expression plasmid 
pcDL-SRa296 was digested with a restriction enzyme Sal I to 
cut out expression unit with a length of about 1.7 kb which 
includes an SR a promoter, SV40 later splice signal, poly (A) 
addition signal, and so on. The digestion products were 
separated by agarose electrophoresis and the 1.7-kb fragment 
was purified using the QIAEX II gel extraction kit (Qiagen) . 
On the other hand, cosmid vector pWElS was digested with a 
restriction enzyme EcoRI and fragments were separated using 
agarose gel electrophoresis. pWElS DNA of 8.2 kb long was 
purified using the QIAEX II gel extraction kit (Qiagen) . The 
ends of these two DNA fragments were bluntled using a DNA 
blunting kit (Takara Shuzo) , ligated using a DNA ligation kit 
(Takara Shuzo) , and transferred into E. coli DH5 a (Gibco BRL) . 
The resultant transformant was grown and the expression cosmid 
pWESRa containing an expression unit was purified using a 
Qiagen column (Qiagen) . 

The cosmid pWE OCIF containing the OCIF genomic DNA with 
a length of about 38 kb obtained in (i) above was digested with 
a restriction enzyme NotI to cut out the OCIF genomic DNA of 
about 38 kb. After separation by agarose gel electrophoresis, 
the DNA was purified using the QIAEX II gel extraction kit 
(Qiagen) . On the other hand, the expression cosmid pWESRa was 
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digested with a restriction enzyme EcoRI and the digestion 
product was extracted with phenol and chloroform, ethanol- 
precipitated, and dissolved in TE. 

pWESRa digested with a restriction enzyme EcoRI and an 
EcoRI -Xmnl-NotI adapter (#1105, #1156 New England 
Biolaboratory Co*) were ligated using T4 DNA ligase (Takara 
Shuzo Co, , Ltd. ) • After removal of the free adapter by agarose 
gel electrophoresis, the product was purified using QIAEX gel 
extraction kit (Qiagen) . The OCIF genomic DNA with a length 
of about 37 kb which was derived from the digestion with 
restriction enzyme NotI and the pWESRato which the adapter was 
attached were ligated using T4 DNA ligase (Takara Shuzo) . The 
DNA was packaged in vitro using the Gigapack packaging extract 
(Stratagene) and infected with E. coli XLl-Blue MR (Stratagene) . 
The resultant transformant was grown and the expression cosmid 
pWESRaOCIF which contained OCIF genomic DNA was inserted was 
purified using a Qiagen column (Qiagen) . The OCIF expression 
cosmid pWESRaOCIF was ethanol-precipitated and dissolved in 
sterile distilled water and used in the following analysis, 
(ii) Transient expression of QCIF genomic DNA and measurement 

of OCIF activity 

A recombinant OCIF was expressed as described below 
using the OCIF expression cosmid pWESRaOCIF obtained in (i) 
above and its activity was measured. COS-7 (8xl0 5 cells/well) 
cells (Riken Cell Bank, RCB0539) were planted in a 6-well plate 
using DMEM culture medium (Gibco BRL) containing 10% fetal 
bovine serum (Gibco BRL) . On the following day, the culture 
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medium was removed and cells were washed with serum-free DMEM 
culture medium. The OCIF expression cosmid pWESRaOCIF which 
had been diluted with OPTI-MEM culture medium (Gibco BRL) was 
mixed with lipophectamine and the mixture was added to the cells 
in each well according to the attached protocol . The expression 
cosmid pWESRa was added to the cells in the same manner as a 
control . The amount of the cosmid DNA and Lipophectamine was 
respectively 3 ^ig and 12 (4,1. After 24 hours, the culture medium 
was removed and 1.5 ml of fresh EX-CELL 301 culture medium (JRH 
Bioscience) was added to each well. The culture medium was 
recovered after 48 hours and used as a sample for the measurement 
of OCIF activity. The measurement of OCIF activity was carried 
out according to the method described by Kumegawa, M. et al, 
(Protein, Nucleic Acid, and Enzyme, Vol. 34, p 999 (1989)) and 
the method of TAKAHASHI , N. et al. (Endocrihology vol. 122, p 
1373 (1988)). The osteoclast formation in the presence of 
activated vitamin D 3 from bone marrow cells isolated from mice 
aged about 17 days was evaluated by the induction of tartaric 
acid resistant acidic phosphatase activity. The inihibition of 
the acid phosphatase was measured and used as the activity of 
the protein which possesses osteoclastogenesis-inhibitory 
activity (OCIF) . Namely, 100 ^il/well of a OCIF sample which 
was diluted with a -MEM culture medium (Gibco BRL) containing 
2xl0" 8 M activated vitamin D 3 and 10% fetal bovine serum was added 
to each well of a 96 well micro plate. Then, 3xl0 5 bone marrow 
cells isolated from mice (about 17-days old) suspended in 100 
]Xl of a -MEM culture medium containing 10% fetal bovine serum 
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were added to each well of the 96 well micro plate and cultured 
for a week at 37°C and 100% humidity under 5% C0 2 atmosphere. 
On days 3 and 5, 160 |al of the conditioned medium was removed 
from each well, and 160 \il of a sample which was diluted with 
a -MEM culture medium containing lxlO" 8 M activated vitamin D 3 
and 10% fetal bovine serum was added. After 7 days from the start 
of culturing, the cells were washed with a phosphate buffered 
saline and fixed with a ethanol/acetone (1:1) solution for one 
minute at room temperature. The osteoclast formation was 
detected by staining the cells using an acidic phosphatase 
activity measurement kit (Acid Phosphatase, Leucocyte, Cat. No. 
387-A, Sigma Company) . A decrease in the number of cells 
positive to acidic phosphatase activity in the presence of 
tartaric acid was taken as the OCIF activity. The results are 
shown in Table 1, which indicates that the conditioned medium 
exhibits the similar activity to natural type OCIF obtained from 
the IMR-90 culture medium and recombinant OCIF produced by CHO 
cells . 
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TABLE 1 

Activity of OCIF expressed by COS-7 cells in the conditioned 
medium 


Dilution 

1/10 

1/20 

1/40 

1/30 

1/160 1/320 

OCIF genomic DNA introduced 
Vector introduced 
Untreated 

++ 

++ 

++ 

++ 

+ 


indicates an activity inhibiting 80% or more of 
osteoclast formation, indicates an activity inhibiting 

30-8 0% of osteoclast formation, and w -" indicates that no 
inhibition of osteoclast formation is observed. 


(iii) Identification of the product bv Western Blotting 

A buffer solution (10 for SDS-PAGE (0.5 M Tris-HCl, 
20% glycerol, 4% SDS, 20 (ig/ml bromophenol blue, pH 6.8) was 
added to 10 |il of the sample for the measurement of OCIF activity 
prepared in (ii) above. After boiling for 3 minutes at 100 C C, 
the mixture was subjected to 10% SDS polyacrylamide 
electrophoresis under non-reducing conditions. The proteins 
were transferred from the gel to a PVDF membrane (ProBlott, 
Perkin Elmer) using semi-dry blotting apparatus (Biorad) . The 
membrane was blocked and incubated for 2 hours at 37°C together 
with a horseradish peroxidase-labeled anti-OCIF antibody 
obtained by labeling the previously obtained OCIF protein with 
horseradish peroxidase according to a conventional method. 
After washing, the protein which has bound the anti-OCIF 
antibody was detected using the ECL system (Amasham) . As shown 
in Figure 1, two bands, one with a molecular weight of about 
120 kilo dalton and the other 60 kilo dalton, were detected in 
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the supernatant obtained from the culture broth of COS-7 cells 
in which pWESRaOCIF was trans fected. On the other hand, these 
two bands with a molecular weight of about 120 kilo dalton and 
60 kilo dalton were not detected in the supernatant obtained 
from the culture broth of COS-7 cells in which pWESR a vector 
was trans fected, confirming that the protein obtained was OCIF. 

INDUSTRIAL APPLICABILITY 
The present invention provides a genomic DNA encoding 
a protein OCIF which possesses an osteoclastogenesis- 
inhibitory activity and a process for preparing this protein 
by a genetic engineering technique using the genomic DNA . The 
protein obtained by expressing the gene of the present invention 
exhibits an osteoclastogenesis-inhibitory activity and is 
useful as an agent for the treatment and improvement of diseases 
involving a decrease in the amount of bone such as osteoporosis, 
other diseases resulting from bone metabolism abnormality such 
as rheumatism or degenerative joint disease, and multiple 
myeloma. The protein is further useful as an antigen to 
establish antibodies useful for an immunological diagnosis of 
such diseases. 

NOTE ON MICROORGANISM 
Depositing Organization: 

The Ministry of International Trade and Industry, National 
Institute of Bioscience and Human Technology, Agency of 
Industrial Science and Technology 

Address: 1-3, Higashi-l-Chome, Tsukuba-shi, Ibaraki-ken, 
Japan 
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Date of Deposition: June 21, 1995 (originally deposited on 
June 21, 1995 and transferred to the international 
deposition according to the Budapest Treaty on 
October 25, 1995) 

Accession No. FERM BP-5267 
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TABLE OF SEQUENCES 


Sequence number: 1 
Length of sequence: 1316 
Sequence Type: nucleic acid 
Strancledness : double 
Topology: linear 

Molecular type: genomic DNA (human OCIF genomic DNA-1) 


Sequence : 


CTGGAGACAT 

ATAACTTGAA 

CACTTGGCCC 

TGATGGGGAA 

GCAGCTCTGC 

AGGGACTTTT 

SO 

TCAGCCATCT 

GTAAACAATT 

TCAGTGGCAA 

CCCGCGAACT 

GTAATCCATG 

AATGGGACCA 

120 

CACTTTACAA 

GTCATCAAGT 

CTAACTTCTA 

GACCAGGGAA 

TTAATGGGGG 

AGACAGCGAA 

180 

CCCTAGAGCA 

AAGTGCCAAA 

CTTCTGTCGA 

TAGCTTGAGG 

CTAGTGGAAA 

GACCTCGAGG 

240 

AGGCTACTCC 

AGAAGTTCAG 

CGCGTAGGAA 

GCTCCGATAC 

CAATAGCCCT 

TTGATGATGG 

300 

TGCGGTTGGT 

GAAGGGAACA 

GTGCTCCGCA 

AGGTTATCCC 

TGCCCCAGGC 

AGTCCAATTT 

360 

TCACTCTGCA 

GATTCTCTCT 

GGCTCTAACT 

ACCCCAGATA 

ACAAGGAGTG 

AATGCAGAAT 

420 

AGCACGGGCT 

TTAGGGCCAA 

TCAGACATTA 

GTTAGAAAAA 

TTCCTACTAC 

ATGGTITATG 

480 

TAAACTTGAA 

GATGAATGAT 

TGCGAACTCC 

CCGAAAAGGG 

CTCAGACAAT 

GCCATGCATA 

540 

AAGAGGGGCC 

CTGTAATTTG 

AGGTTTCAGA 

ACCCGAAGTG 

AAGGGGTCAG 

GCAGCCGGGT 

600 

ACGGCGGAAA 

CTCACAGCTT 

TCGCCCAGCG 

AGAGGACAAA 

GGTCTGGGAC 

ACACTCCAAC 

660 

TGCGTCCGGA 

TCTTGGCTGG 

ATCGGACTCT 

CAGGGTGGAG 

GAGACACAAG 

CACAGCAGCT 

720 

GCCCAGCGTG 

TGCCCAGCCC 

TCCCACCGCT 

GGTCCCGGCT 

GCCAGGAGGC 

TGGCCGCTGG 

780 

CGGGAAGGGG 

CCGGGAAACC 

TCAGAGCCCC 

GCGGAGACAG 

CAGCCGCCTT 

GTTCCTCAGC 

840 

CCGGTGGCTT 

TTTTTTCCCC 

TGCTCTCCCA 

GGGGACAGAC 

ACCACCGCCC 

CACCCCTCAC 

900 

GCCCCACCTC 

CCTGGGGGAT 

ccrrrccccc 

CCAGCCCTGA 

AAGCGTTAAT 

CCTGGAGCTT 

960 

TCTGCACACC 

CCCCGACCGC 

TCCCGCCCAA 

GCTTCCTAAA 

AAAGAAAGGT 

GCAAAGTTTG 1020 

GTCCAGGATA 

GAAAAATGAC 

TGATCAAAGG 

CAGGCGATAC 

TTCCTGTTGC 

CGGGACGCTA 1080 

TATATAACGT 

GATGAGCGCA 

CGGGCTGCGG 

AGACGCACCG 

GAGCGCTCGC 

CCAGCCGCCG 1140 
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CCTCCAAGCC CCTGAGGTTT CCGGGGACCA CA ATG AAC AAG TTG CTG TGC TGC 

Met Asn Lys Leu Leu Cys Cys 
-20 -15 


1193 


GCG CTC GTG GTAAGTCCCT GGGCCAGCCG ACGGGTGCCC GGCGCCTGGG 1242 
Ala Leu Val 

GAGGCTGCTG CCACCTGGTC TCCCAACCTC CCAGCGGACC GGCGGGGAGA AGGCTCCACT 1302 
CGCTCCCTCC CAGG 1316 

Sequence number: 2 
Length of sequence: 9898 
Sequence Type : nucleic acid 
Strandedness : double 
Topology: linear 

Molecular type: genomic DNA (human OCIF genomic DNA-2) 
Sequence : 

GCTTACTTTG TGCCAAATCT CATTAGGCTT AAGGTAATAC AGGACTTTGA GTCAAATGAT 60 
ACTGTTGCAC ATAAGAACAA ACCTATTTTC ATGCTAAGAT GATGCCACTG TGTTCCTTTC 120 
TCCTTCTAG TTT CTG GAC ATC TCC ATT AAG TGG ACC ACC CAG GAA ACG TTT 171 
Phe Leu Asp lie Ser He Lys Trp Thr Thr Gin Glu Thr Phe 
-10 -5 1 

CCT CCA AAG TAC CTT CAT TAT GAC GAA GAA ACC TCT CAT CAG CTG TTG 219 
Pro Pro Lys Tyr Leu His Tyr Asp Glu GLu Thr Ser His Gin Leu Leu 
5 10 15 
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TGT GAC AAA TGT CCT CCT GGT ACC TAG CTA AAA CAA CAC TGT ACA GCA 267 
Cys Asp Lys Cys Pro Pro Gly Thr Tyr Leu Lys Gin His Cys. Thr Ala 
20 25 30 35 

AAG TGG AAG ACC GTG TGC GCC CCT TGC CCT GAC CAC TAC TAC ACA GAC 315 
Lys Trp Lys Thr Val Cys Ala Pro Cys Pro Asp His Tyr Tyr Thr Asp 
40 45 50 

AGC TGG CAC ACC AGT GAC GAG TGT CTA TAC TGC AGC CCC GTG TGC AAG 363 
Ser Trp His Thr Ser Asp Glu Cys Leu Tyr Cys Ser Pro Val Cys Lys 
55 60 65 

GAG CTG CAG TAC GTC AAG CAG GAG TGC AAT CGC ACC CAC AAC CGC GTG 411 
Glu Leu Gin Tyr Val Lys Gin Glu Cys Asn Arg Thr His Asn Arg Val 
70 75 80 

TGC GAA TGC AAG GAA GGG CGC TAC CTT GAG ATA GAG TTC TGC TTG AAA 459 
Cys Glu Cys Lys Glu Gly Arg Tyr Leu Glu He Glu Phe Cys Leu Lys 
85 90 95 

CAT AGG AGC TGC CCT CCT GGA TTT GGA GTG GTG CAA GCT G GTACGTGTCA 509 
His Arg Ser Cys Pro Pro Gly Phe Gly Val Val Gin Ala 
100 105 110 

ATGTGCAGCA AAATTAATTA GGATCATGCA AAGTCAGATA GTTGTGACAG TTTAGGAGAA 569 
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P A P T P I FTT , P TPT 

uiUrAlUAwAl 

TATAPX'A'PAP 
lAlAUwUAU 

PA A ATTPPA A 

LAAAI lulM 

APPTA ATP A A 

AWuAAlUAA 

APPTPPP APP 

nop 

629 

T A PPT A ptat 

lAWiTACiAl 

PTPTP TP PAP 

TppTTpp a A A 

lull III AAA 

PPAPPATTPP 

TPAP APP A A 1* 

1 LAuAMiAAT 

A PTTTPPP A A 

ACTTTGCCAC 

AAA 

689 

mini nnpn A A 

TACAGGGCAA 

TT1AATGACA 

h htptaa a at 

AATCTCAAAT 

PP APP A A A VI* 

GCAGCAAATT 

ATTCTCTCAT 

PAPA TP A A T»A 

GAGATGCATG 

749 

ATGGTTTTTT 

TT1TTTTTTT 

TAAAGAAACA 

ft a nmn ft a A mm 

AACTCAAGTT 

A A 1 /in 4 /VWttA & 

GCACTATTGA 

TAGTTGATCT 

809 

ATACCTCTAT 

ATTTCACTTC 

AGCATGGACA 

mamuma ft ft ft 

CCTTCAAACT 

GCAGCACTTT 

TTGACAAACA 

869 

TCAGAAATGT 

TAATTTATAC 

CAAGAGAGTA 

ATTATGCTCA 

TATTAATGAG 

ACTCTGGAGT 

92£ 

GCTAACAATA 

AGCAGTTATA 

ATTAATTATG 

TAAAAAATGA 

GAATGGTGAG 

GGGAATTGCA 

989 

TTTCATTATT 

AAAAACAAGG 

CTAGTTCTTC 

CTTTAGCATG 

GGAGCTGAGT 

GTTTGGGAGG 

1049 

GTAAGGACTA 

TAGCAGAATC 

TCTTCAATGA 

GCTTATTCTT 

TATCTTAGAC 

AAAACAGATT 

1109 

GTCAAGCCAA 

GAGCAAGCAC 

TTGCCTATAA 

ACCAAGTGCT 

TTCTCTTTTG 

CATTTTGAAC 

1169 

AGCATTGGTC 

AGGGCTCATG 

TGTATTGAAT 

CTTTTAAACC 

AGTAACCCAC 

GTTTTTTTTC 

1229 

TGCCACATTT 

GCGAAGCTTC 

AGTGCAGCCT 

ATAACTTTTC 

ATAGCTTGAG 

AAAATTAAGA 

1289 

GTATCCACTT 

ACTTAGATGG 

AAGAAGTAAT 

CAGTATAGAT 

TCTGATGACT 

CAGTTTGAAG 

1349 

CAGTGTTTCT 

CAACTGAAGC 

CCTGCTGATA 

TTTTAAGAAA 

M m MAMA M ft ftHMBti 

TATCTGGATT 

CCTAGGCTGG 

1409 

ACTCCTTTTT 

GTGGGCAGCT 

GTCCTGCGCA 

TTGTAGAATT 

TTGGCAGCAC 

CCCTGGACTC 

1469 

TAGCCACTAG 

ATACCAATAG 

CAGTCCTTCC 

CCCATGTGAC 

AGCCAAAAAT 

Am A Mil w r» ft A ■ A 

GTCTTCAGAC 

1529 

ACTGTCAAAT 

GTCGCCAGGT 

aaa * A ft A a 

GGCAAAATCA 

CTCCTGGTTG 

a A 4 4 n 4 An/im 

AGAACAGGGT 

A A TAIL A1*PA*n 

CATCAATGCT 

1 589 

& ft AM ft HIAMAMl 

AAGTATCTGT 

MCTATTTTA 

ACTCTCAAAA 

CTTGTGATAT 

A PA A APTAtPA 

ACAAAGTCTA 

& 4 TT ATT AP A 

AATTATTAGA 

1649 

CGACCAATAC 

■wiim a a Afiufun ft 

HTAGGTTTA 

■ ft A A A iMini 

AAGGCATACA 

4 1 mrX 4 A A A A fit 

AATGAAACAT 

WA A 4 A A A TA A 

TCAAAAATCA 

A A ATPT A TTP 

AAATCTATTC 

« 7AA 

1709 

TGTTTCTCAA 

* jm ft s*fft A ft ft tlHfy 

ATAGTGAATC 

MWiiWi i ft a » mm 

TTATAAAATT 

1 1 1* ft A ft a A ji A 

AATCACAGAA 

P A TP A A A A 'FT 

GATGCAAATT 

PP HTP AP APT 

GCATCAGAGT 

1769 

CCCTTAAAAT 

TCCTCTTCGT 

ATGAGTATTT 

A ft A An A AA ft ft 

GAGGGAGGAA 

TTGGTGATAG 

TTCCTACTTT 

4 AAA 

1829 

CTATTGGATG 

GTACTTTGAG 

ACTCAAAAGC 

TAAGCTAAGT 

TGTGTGTGTG 

TCAGGGTGCG 

1889 

GGGTGTGGAA 

TCCCATCAGA 

TAAAAGCAAA 

TCCATGTAAT 

TCATTCAGTA 

AGTTGTATAT 

1949 

GTAGAAAAAT 

GAAAAGTGGG 

CTATGCAGCT 

TGGAAACTAG 

AGAATTTTGA 

AAAATAATGG 

2009 

AAATCACAAG 

GATCTTTCTT 

AAATAAGTAA 

GAAAATCTGT 

TTGTAGAATG 

AAGCAAGCAG 

2069 

GCAGCCAGAA 

GACTCAGAAC 

AAAAGTACAC 

ATTTTACTCT 

GTGTACACTG 

GCAGCACAGT 

2129 
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GGCATTTATT TACCTCTCCC TCCCTAAAAA CCCACACAGC GCTTCCTCTT GGGAAATAAG 2189 
AGGTTTCCAG CCCAMGAGA AGGAAAGACT ATGTGGTGTT ACTCTAAAAA GTATTTAATA 2249 
ACCGTTTTGT TGTTGCTGTT GCTGTTTTGA AATCAGATTG TCTCCTCTCC ATATTTTATT 2309 
TACTTCATTC TGTTAATTCC TGTGGAATTA CTTAGAGCAA GCATGGTGAA TTCTCAACTG 2369 
TAAAGCCAAA TTTCTCCATC ATTATAATTT CACATTTTGC CTGGCAGGTT ATAATTTTTA 2429 
TATTTCCACT GATAGTAATA AGGTAAAATC ATTACTTAGA TGGATAGATC TTTTTCATAA 2489 
AAAGTACCAT CAGTTATAGA GGGAAGTCAT GTTCATGTTC AGGAAGGTCA TTAGATAAAG 2549 
CTTCTGAATA TATTATGAAA CATTAGTTCT GTCATTCTTA GATTCTTTTT GTTAAATAAC 2609 
TTTAAAAGCT AACTTACCTA AAAGAAATAT CTGACACATA TGAACTTCTC ATTAGGATGC 2669 
AGGAGAAGAC CCAAGCCACA GATATGTATC TGAAGAATGA ACAAGATTCT TAGGCCCGGC 2729 
ACGGTGGCTC ACATCTGTAA TCTCAAGAGT TTGAGAGGTC AAGGCGGGCA GATCACCTGA 2789 
GGTCAGGAGT TCAAGACCAG CCTGGCCAAC ATGATGAAAC CCTGCCTCTA CTAAAAATAC 2849 
AAAAATTAGC AGGGCATGGT GGTGCATGCC TGCAACCCTA GCTACTCAGG AGGCTGAGAC 2909 
AGGAGAATCT CTTGAACCCT CGAGGCGGAG GTTGTGGTGA GCTGAGATCC CTCTACTGCA 2969 
CTCCAGCCTG GGTGACAGAG ATGAGACTCC GTCCCTGCCG CCGCCCCCGC CTTCCCCCCC 3029 
AAAAAGATTC TTCTTCATGC AGAACATACG GCAGTCAACA AAGGGAGACC TGGGTCCAGG 3089 
TGTCCAAGTC ACTTATTTCG AGTAAATTAG CAATGAAAGA ATGCCATGGA ATCCCTGCCC 3149 
AAATACCTCT GCTTATGATA TTGTAGAATT TGATATAGAG TTGTATCCCA TTTAAGGAGT 3209 
AGGATGTAGT AGGAAAGTAC TAAAAACAAA CACACAAACA GAAAACCCTC TTTGCTTTGT 3269 
AAGGTGGTTC CTAAGATAAT GTCAGTGCAA TGCTGGAAAT AATATTTAAT ATGTGAAGGT 3329 
TTTAGGCTGT GTTTTCCCCT CCTGTTCTTT TTTTCTGCCA GCCCTTTGTC ATTTTTGCAG 3389 
GTCAATGAAT CATGTAGAAA GAGACAGGAG ATGAAACTAG AACCAGTCCA TTTTGCCCCT 3449 
TTTTTTATTT TCTGGTTTTG GTAAAAGATA CAATGAGGTA GGAGGTTGAG ATTTATAAAT 3509 
GAAGTTTAAT MGTTTCTGT AGCTTTGATT TTTCTCTTTC ATATTTGTTA TCTTGCATAA 3569 
GCCAGAATTG GCCTGTAAAA TCTACATATG GATATTGAAG TCTAAATCTG TTCAACTAGC 3629 
TTACACTAGA TGGAGATATT TTCATATTCA GATACACTGG AATGTATGAT CTAGCCATGC 3689 
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GTAATATAGT 

CAAGTGTTTG 

AAGGTATTTA 

TTTTTAATAG 

CGTCTTTAGT TGTGGACTGG 

3749 

TTCAAGTTTT 

TCTGCCAATG 

ATTTCTTCAA 

ATTTATCAAA 

TATTTTTCCA TCATGAAGTA 

3809 

AAATGCCCTT 

GCAGTCACCC 

TTCCTGAAGT 

nun a j a ^ ^ a am 

TTGAACGACT 

CTGCTGTTTT AAACAGTTTA 

3869 

ACCAAATGGT 

ATATCATCTT 

CCGTTTACTA 

TGTAGCTTAA 

CTGCAGGCTT ACGCTTTTGA 

3929 

GTCAGCGGCC 

AACTTTATTG 

CCACCTTCAA 

AAGTTTATTA 

TAATGTTGTA AATTTTTACT 

3989 

TCTCAAGGTT 

AGCATACTTA 

GGAGTTGCTT 

CACAATTAGG 

ATTCAGGAAA GAAAGAACTT 

4049 

CAGTAGGAAC 

TGATTGGAAT 

TTAATGATGC 

AGCATTCAAT 

GGGTACTAAT TTCAAAGAAT 

4109 

GATATTACAG 

CAGACACACA 

GCAGTTATCT 

TGATTTTCTA 

GGAATAATTG TATGAAGAAT 

4169 

ATGGCTGACA 

ACACGGCCTT 

ACTGCCACTC 

AGCGGAGGCT 

GGACTAATGA ACACCCTACC 

4229 

CTTCTTTCCT 

TTCCTCTCAC 

ATTTCATGAG 

CGTTTTGTAG 

GTAACGAGAA AATTGACTTG 

4289 

CATTTGCATT 

ACAAGGAGGA 

GAAACTGGCA 

AAGGGGATGA 

TGGTGGAAGT TTTGTTCTGT 

4349 

CTAATGAAGT 

GAAAAATGAA 

AATGCTAGAG 

TTTTGTGCAA 

CATAATAGTA GCAGTAAAAA 

4409 

CCAAGTGAAA 

AGTCTTTCCA 

AAACTGTGTT 

AAGAGGGCAT 

CTGCTGGGAA ACGATTTGAG 

4469 

GAGAAGGTAC 

TAAATTGCTT 

GGTATTTTCC 

GTAG GA ACC CCA GAG CGA AAT ACA 

4523 


Gly Thr Pro Glu Arg Asn Thr 
115 


GTT TGC AAA AGA TGT CCA GAT GGG TTC TTC TCA AAT GAG ACG TCA TCT 4571 
Val Cys Lys Arg Cys Pro Asp Gly Phe Phe Ser Asn Glu Thr Ser Ser 
120 125 130 135 

AAA CCA CCC TGT AGA AAA CAC ACA AAT TGC AGT GTC TTT GGT CTC CTG 4619 
Lys Ala Pro Cys Arg Lys His Thr Asn Cys Ser Val Phe Gly Leu Leu 
140 145 150 

CTA ACT CAG AAA GGA AAT GCA ACA CAC GAC AAC ATA TGT TCC GGA AAC 4667 
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Leu Thr Gin Lys Gly Asn Ala Thr His Asp Asa He Cys Ser Cly Asn 
155 160 165 

ACT GAA TCA ACT CAA AAA TGT GCA ATA G GTAATTACAT TCCAAAATAC 4715 
Ser Glu Ser Thr Gin Lys Cys Gly He 
170 175 


GTCTTTGTAC 

GATTTTGTAG 

TATCATCTCT CTCTCTGAGT 

TGAACACAAG 

GCCTCCAGCC 

4775 

ACATTCTTGG 

TCAAACTTAC 

ATTTTCCCTT TCTTGAATCT 

TAACCAGCTA 

AGGCTACTCT 

4835 

CGATGCATTA 

CTGCTAAAGC 

TACCACTCAC AATCTCTCAA 

AAACTCATCT 

TCTCACAGAT 

4895 

AACACCTCAA 

AGCTTCATTT 

TCTCTCCTTT CACACTGAAA 

TCAAATCTTG 

CCCATAGGCA 

4955 

AAGGGCAGTG 

TCAAGTTTGC 

CACTGAGATG AAATTAGGAG 

AGTCCAAACT 

GTAGAATTCA 

5015 

CGTTGTGTGT 

TATTACTTTC 

ACGAATGTCT GTATTATTAA 

CTAAAGTATA 

TATTGGCAAC 

5075 

TAAGAAGCAA 

AGTGATATAA 

ACATGATGAC AAATTAGGCC 

AGGCATGGTG 

GCTTACTCCT 

5135 

ATAATCCCAA 

CATTTTGGGG 

GGCCAAGGTA GGCAGATCAC 

TTGAGGTCAG 

GATTTCAAGA 

5195 

CCAGCCTGAC 

CAACATGGTG 

AAACCTTGTC TCTACTAAAA 

ATACAAAAAT 

TACCTGGGCA 

5255 

TGGTAGCAGG 

CACTTCTAGT 

ACCAGCTACT CAGGGCTGAG 

GCAGGAGAAT 

CGCTTGAACC 

5315 

CAGGAGATGG 

AGGTTGCAGT 

GAGCTGAGAT TGTACCACTG 

CACTCCAGTC 

TGGGCAACAG 

5375 

AGCAAGATTT 

CATCACACAC 

ACACACACAC ACACACACAC 

ACACATTAGA 

AATGTGTACT 

5435 

TGGCTTTGTT 

ACCTATGGTA 

TTAGTGCATC TATTGCATGG 

AACTTCCAAG 

CTACTCTGGT 

5495 

TGTGTTAAGC 

TCTTCATTGG 

GTACAGGTCA CTAGTATTAA 

GTTCAGGTTA 

TTCGGATGCA 

5555 

TTCCACGGTA 

GTGATGACAA 

TTCATCAGGC TAGTGTGTGT 

GTTCACCTTG 

TCACTCCCAC 

5615 

CACTAGACTA 

ATCTCAGACC 

TTCACTCAAA GACACATTAC 

ACTAAAGATG 

ATTTGCTTTT 

5675 

TTGTGTTTAA 

TCAAGCAATG 

GTATAAACCA GCTTGACTCT 

CCCCAAACAG 

TTTTTCGTAC 

5735 

TACAAAGAAG 

TTTATGAAGC 

AGAGAAATGT GAATTGATAT 

ATATATGAGA 

TTCTAACCCA 

5795 

GTTCCAGCAT 

TGTTTCATTG 

TGTAATTGAA ATCATAGACA 

AGCCATTTTA 

GCCTTTGCTT 

5855 
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TCTTATCTAA 

AAAAAAAAAA 

AAAAAAATGA 

AGGAAGGGGT 

ATTAAAAGGA 

GTGATCAAAT 

frit I™ 

5915 

TTTAACATTC 

TCTTTAATTA 

ATTCATTTTT 

MTTTTACTT 

TTTTTCATTT 

ATTGTGCACT 

5375 

TACTATGTGG 

TACTGTGCTA 

TAGAGGCTTT 

AACATTTATA 

AAAACACTGT 

GAAAGTTGCT 

6035 

TCAGATGAAT 

ATAGGTAGTA 

GAACGGCAGA 

ACTAGTATTC 

AAAGCCAGGT 

CTGATGAATC 

6095 

CAAAAACAAA 

CACCCATTAC 

TCCCATTTTC 

TGGGACATAC 

TTACTCTACC 

CAGATGCTCT 

6155 

GGGCTTTGTA 

ATGCCTATGT 

AAATAACATA 

GTTTTATGTT 

rGGTTATTTT 

CCTATGTAAT 

6215 

GTCTACTTAT 

ATATCTGTAT 

CTATCTCTTG 

CTTTGTTTCC 

AAAGGTAAAC 

TATGTGTCTA 

6275 

AATGTGGGCA 

AAAAATAACA 

CACTATTCCA 

AATTACTGTT 

CAAATTCCTT 

TAAGTCAGTG 

6335 

ATAATTATTT 

GTTTTGACAT 

TAATCATGAA 

GTTCCCTGTG 

GGTACTAGGT 

AAACCTTTAA 

6395 

TAGAATGTTA 

ATGTTTGTAT 

TCATTATAAG 

AATTTTTGGC 

TGTTACTTAT 

TTACAACAAT 

6455 

ATTTCACTCT 

AAITAGACAT 

TTACTAAACT 

TTCTCTTGAA 

AACAATGCCC 

AAAAAAGAAC 

6515 

ATTAGAAGAC 

ACGTAAGCTC 

AGTTGGTCTC 

TGCCACTAAG 

ACCAGCCAAC 

AGAAGCTTGA 

6575 

TTTTATTCAA 

ACTTTGCATT 

TTAGCATATT 

TTATCTTGGA 

AAATTCAATT 

GTGTTGGTTT 

6635 

TTTGTTTTTG 

TTTGTATTGA 

ATAGACTCTC 

AGAAATCCAA 

TTGTTGAGTA 

AATCTTCTGG 

6695 

GTTTTCTAAC 

CTTTCTTTAG 

AT GTT ACC 

CTG TGT GAG GAG GCA TTC TTC AGG 

6747 


Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg 
180 185 


TTT GCT GTT CCT ACA AAG TTT ACG CCT AAC TGG CTT AGT GTC TTG GTA 6795 
Phe Ala Val Pro Thr Lys Phe Thr Pro Asa Trp Leu Ser Val Leu Val 
190 195 200 

GAG AAT TTG CCT GGC ACC AAA GTA AAC GCA GAG AGT GTA GAG AGG ATA 6843 
Asp Asn Leu Pro Gly Thr Lys Val Asn Ala Glu Ser Val Glu Arg He 
205 210 215 
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AAA CGG CAA CAC AGC TCA CAA GAA CAG ACT TTC CAG CTG CTG AAG TTA 
Lys Are Gin His Ser Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys Leu 
220 225 230 235 


6891 


TGG AAA CAT CM AAC AAA GAC CAA GAT ATA GTC AAG AAG ATC ATC CAA G 6940 
Trp Lys His Gin Asn Lys Asp Gin Asp He Val Lys Lys He He Gin 
240 245 250 


GTATGATAAT 

CTAAAATAAA 

AAGATCAATC 

CAGGAACAAG 

ACTGCATGTA 

TGTTTAGTTG 

GTTGGACTGA 

AAAAGTTTCC 

ACCTGATAAT 

GGTTTTGTTC 

TCACCCCTGC 

TCCCCAGTTT 

AAGAGAAATG 

CATTTGAAGG 

CAGGGCTGTA 

GCTTCTGTAA 

GCAGCCCCTC 

TAGACCACCA 

ACATTGCACC 

TCTACCAAGA 

AGCTCTGTTG 

TTTTCGTAGC 

TTACAAATAT 

GTTCTTATTA 

TTTTAATGGC 

ATATGTTATG 

AGAATTAATG 

TGTAGGAAAA 

AGCTAGTTAC 

AGCAAAATGT 

CTCCTTTAGA 

MTGGTGTGA 

GAGAGAAACA 

TCTGAAGAAA 

AGGAGTTTCA 

TCCAGTGTGG 

AGTTCTGACT 

TCAGTAAGCA 

TTGGGAGGAC 

ATAATGCAGA 

CAGGGTCAGT 

GAGAAATTCA 

TATAGTCTTG 

CACTACCCTA 

AAAAACTTCA 

GAGACCAACG 

TCTTTGAGAG 

CTGATTGCTT 

TTGAGCAAAC 

CAAAAGTATT 

CTTTGAACGT 

AAAATCAGAG 

ACCGTTAGAA 

TTGGAAGCAA 


AGAAATCAAA GACACCTATT TATCATAAAC- 7000 
TGTGGATCTT GTTTCCCTGT TGGAATCATT 7060 
GTAGATGTGA TTCCACAAAC AGTTATACAA 7120 
CCTTGTAAAG TATGTTGAAC ACTCTAAGAG 7180 
TCTCAGGGAG TCGCTTCCAG ATCCCTTAAC 7240 
AGGAGAAGCT CTATAACCAC TTTGTATCTT 7300 
TATTTACTTG GTAATTCTCT CCAGGTAGGC 7360 
ATCCTCATGA TATGGCCTGC ATTAAAATTA 7420 
AGATAAAATC TGAAAAGTGT TTGAGCCTCT 7480 
TCTCACATCT TATAAGTTTA TATAAAGATT 7540 
GAGAGAGATA GGGAGAGAAG TGTGAAAGAA 7600 
ACTGTAAGCT TTACGACACA TGATGGAAAG 7660 
ATGCTAGAAG AAAAAGGAAG AAGAGTTTCC 7720 
TTCAGGTCCT CACCAGTAGT TAAATGACTG 7780 
AGTATCTGAA ACCGGGGCAA CAGATTTTAG 7840 
TTGCTTATGC AAAGAGTAAA CTTTTATGTT 7900 
ATAATTAGCC CTGAAGCCGA AAGAAAAGAG 7960 
CCAAATTCCC TATTTTATAA ATGAGGACAT 8020 
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TTTAACCCAG AAAGATGAAC CGATTTGGCT TAGGGCTCAC AGATACTAAG TGACTCATGT 8080 
CATTAATAGA AATGTTAGTT CCTCCCTCTT AGGTTTGTAC CCTAGCTTAT TACTGAAATA S140 
TTCTCTAGGC TGTGTGTCTC CTTTAGTTCC TCGACCTCAT GTCTTTGAGT TTTCAGATAT 8200 
CCTCCTCATG GAGGTAGTCC TCTGGTGCTA TGTGTATTCT TTAAAGGCTA GTTACGGCAA 8260 
TTAACTTATC AACTAGCGCC TACTAATGAA ACTTTGTATT ACAAAGTAGC TAACTTGAAT 8320 
ACTTTCCTTT TTTTCTGAAA TGTTATGGTG GTAATTTCTC AAACTTTTTC TTAGAAAACT 8380 
GAGAGTGATG TGTCTTATTT TCTACTGTTA ATTTTCAAAA TTAGGAGCTT CTTCCAAAGT 8440 
TTTGTTGGAT GCCAAAAATA TATAGCATAT TATCTTATTA TAACAAAAAA TATTTATCTC 8500 
AGTTCTTAGA AATAAATGGT GTCACTTAAC TCCCTCTCAA AAGAAAAGGT TATCATTGAA 8560 
ATATAATTAT GAMTTCTGC AAGAACCTTT TGCCTCACGC TTGTTTTATG ATGGCATTGG 8620 
ATGAATATAA ATGATGTGAA CACTTATCTG GGCTTTTGCT TTATGCAG AT ATT GAC 8676 

Asp lie Asp 

CTC TGT GAA AAC AGC GTG CAG CGG CAC ATT GGA CAT GCT AAC CTC ACC 8724 
Leu Cys Glu Asn Ser Val Gin Arg His He Gly His Ala Asn Leu Thr 
255 260 265 270 

TTC GAG CAG CTT CGT AGC TTG ATG GAA AGC TTA CCG GGA AAG AAA GTG 8772 
Phe Glu Gin Leu Arg Ser Leu Met Glu Ser Leu Pro Gly Lys Lys Val 
275 280 285 

GGA GCA GAA GAC ATT GAA AAA ACA ATA AAG GCA TGC AAA CCC ACT GAC 8820 
Gly Ala Glu Asp He Glu Lys Thr He Lys Ala Cys Lys Pro Ser Asp 
290 295 300 

CAG ATC CTG AAG CTG CTC AGT TTG TGG CGA ATA AAA AAT GGC GAC CAA 8868 
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Gin lie Leu Lys Leu Leu Ser Leu Trp Arg lie Lys Asu Gly Asp Gin 
305 310 315 

GAC ACC TTG AAG GGC CTA ATG CAC GCA CTA AAG CAC TCA AAG ACG TAC 8916 
Asp Thr Leu Lys Gly Leu Met His Ala Leu Lys His Ser Lys Thr Tyr 
320 325 330 

CAC TTT CCC AAA ACT GTC ACT CAG AGT CTA AAG AAG ACC ATC AGG TTC 8964 
His Phe Pro Lys Thr Val Thr Gin Ser Leu Lys Lys Thr He Arg Phe 
335 340 345 350 

CTT CAC AGC TTC ACA ATG TAC AAA TTG TAT CAG AAG TTA TTT TTA GAA 9012 
Leu His Ser Phe Thr Met Tyr Lys Leu Tyr Gin Lys Leu Phe Leu Glu 
355 360 365 

ATG ATA GGT AAC CAG GTC CAA TCA GTA AAA ATA AGC TGC TTA 9054 
Met lie Gly Asn Gin Val Gin Ser Val Lys He Ser Cys Leu 
370 375 380 

TAACTGGAAA TGGCCATTGA GCTGTTTCCT CACAATTGGC GAGATCCCAT GGATGAGTAA 9114 
ACTGTTTCTC AGGCACTTGA GGCTTTCAGT GATATCTTTC TCATTACCAG TGACTAATTT 9174 
TGCCACAGGG TACTAAAAGA AACTATGATG TGGAGAAAGG ACTAACATCT CCTCCAATAA 9234 
ACCCCAAATG GTTAATCCAA CTGTCAGATC TGGATCGTTA TCTACTGACT ATATTTTCCC 9294 
TTATTACTGC TTGCAGTAAT TCAACTGGAA ATTAAAAAAA AAAAACTAGA CTCCACTGGG 9354 
CCTTACTAAA TATGGGAATG TCTAACTTAA ATAGCTTTGG GATTCCAGCT ATGCTAGAGG 9414 
CTTTTATTAG AAAGCCATAT TTTTTTCTGT AAAAGTTACT AATATATCTC TAACACTATT 9474 
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ACAGTATTGC TATTTATATT CATTCAGATA TAAGATTTGG ACATATTATC ATCCTATAAA 9534 
GAAACGGTAT GACTTAATTT TAGAAAGAAA ATTATATTCT GTTTATTATG ACAAATGAAA 9594 
GAGAAAATAT ATATTTTTAA TGGAAAGTTT GTAGCATTTT TCTAATAGGT ACTGCCATAT 9654 
TTTTCTGTGT GGAGTATTTT TATAATTTTA TCTGTATAAG CTGTAATATC ATTTTATAGA 9714 
AAATGCATTA TTTAGTCAAT TGTTTAATGT TGGAAAACAT ATGAAATATA AATPATCTGA 9774 
ATATTAGATG CTCTGAGAAA TTGAATGTAC CTTATTTAAA AGATTTTATG GTTTTATAAC 9834 
TATATAAATG ACATTATTAA AGTTTTCAAA TTATTTTTTA TTGCTTTCTC TGTTGCTTTT 9894 
jftn 9898 

Sequence number: 3 
Length of sequence: 401 
Sequence Type: amino acid 
Strandedness : single stranded 
Topology: linear 
Molecular type: protein 


Sequence : 

Met Asn Asn Leu Leu Cys Cys Ala Leu Val Phe Leu Asp He Ser 
-20 -15 -10 

He Lys Trp Thr Thr Gin Glu Thr Phe Pro Pro Lys Tyr Leu His 
-5 1 5 

Tyr Asp Glu Glu Thr Ser His Gin Leu Leu Cys Asp Lys Cys Pro 

10 15 20 

Pro Gly Thr Tyr Leu Lys Gla His Cys Thr Ala Lys Trp Lys Thr 

25 30 35 

Val Cys Ala Pro Cys Pro Asp His Tyr Tyr Thr Asp Ser Trp His 

40 45 50 
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He Gin Asp He Asp Leu Cys Glu Asn Ser Val Gin Arg His He 

250 255 260 

Gly His Ala Asn Leu Thr Phe Glu Gin Leu Arg Ser Leu Met Glu 

265 270 275 

Ser Leu Pro Gly Lys Lys Val Gly Ala Glu Asp He Glu Lys Thr 

280 285 290 

lie Lys Ala Cys Lys Pro Ser Asp Gin lie Leu Lys Leu Leu Ser 

295 300 305 

Leu Trp Arg He Lys Asn Gly Asp Gin Asp Thr Leu Lys Gly Leu 

310 315 320 

Met His Ala Leu Lys His Ser Lys Thr Tyr His Phe Pro Lys Thr 

325 330 335 

Val Thr Gin Ser Leu Lys Lys Thr lie Arg Phe Leu His Ser Phe 

340 345 350 

Thr Met Tyr Lys Leu Tyr Gin Lys Leu Phe Leu Glu Met He Gly 

355 360 365 

Asn Gin Val Gin Ser Val Lys He Ser Cys Leu 

370 375 380 

Sequence number: 4 
Length of sequence: 1206 
Sequence Type : nucleic acid 
Strandedness : single stranded 
Topology: linear 
Molecular type : cDNA 
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Sequence : 
ATGAACAACT TGCTCTCCTC CGCGCTCGTG 

CAGGAAACGT TTCCTCCAAA GTACCTTCAT 

TGTGACAAAT GTCCTCCTGG TACCTACCTA 

GTGTGCGCCC CTTGCCCTGA CCACTACTAC 

CTATACTGCA GCCCCGTGTG CAAGGAGCTG 

CACAACCGCG TGTGCGAATG CAAGGAAGGG 

CATAGGAGCT GCCCTCCTGG ATTTGGAGTG 

GTTTGCAAAA GATCTCCAGA TGGGTTCTTC 

AGAAAACACA CAAATTGCAG TGTCTTTGGT 

CACGACAACA TATGTTCCGG AAACAGTGAA 

CTGTGTGAGG AGGCATTCTT CAGGTTTGCT 

AGTGTCTTGG TAGACAATTT GCCTGGCACC 

AAACGGCAAC ACAGCTCACA AGAACAGACT 

AACAAAGACC AAGATATAGT CAAGAAGATC 

GTGCAGCGGC ACATTGGACA TGCTAACCTC 

AGCTTACCGG GAAAGAAAGT GGGAGCAGAA 

CCCAGTGACC AGATCCTGAA GCTGCTCAGT 

ACCTTGAAGG GCCTAATGCA CGCACTAAAG 

GTCACTCAGA GTCTAAAGAA GACCATCAGG 

TATCAGAAGT TATTTTTAGA AATGATAGGT 

TTATAA 


TTTCTGGACA TCTCCATTAA GTGGACCACC 60 
TATGACGAAG AAACCTCTCA TCAGCTGTTG 120 
AAACAACACT GTACAGCAAA GTGGAAGACC 180 
ACAGACAGCT GGCACACCAG TGACGAGTGT 240 
CAGTACGTCA AGCAGGAGTG CAATCGCACC 300 
CGCTACCTTG AGATAGAGTT CTGCTTGAAA 360 
GTGCAAGCTG GAACCCCAGA GCGAAATACA 420 
TCAAATGAGA CGTCATCTAA AGCACCCTGT 480 
CTCCTGCTAA CTCAGAAAGG AAATGCAACA 540 
TCAACTCAAA AATGTGGAAT AGATGTTACC 600 
GTTCCTACAA AGTTTACGCC TAACTGGCTT 660 
AAAGTAAACG CAGAGAGTGT AGAGAGGATA 720 
TTCCAGCTGC TGAAGTTATG GAAACATCAA 780 
ATCCAAGATA TTGACCTCTG TGAAAACAGC 840 
ACCTTCGAGC AGCTTCGTAG CTTGATGGAA 900 
GACATTGAAA AAACAATAAA GGCATGCAAA 960 
TTGTGGCGAA TAAAAAATGG CGACCAAGAC 1020 
CACTCAAAGA CGTACCACTT TCCCAAAACT 1080 
TTCCTTCACA GCTTCACAAT GTACAAATTG 1140 
AACCAGGTCC AATCAGTAAA AATAAGCTGC 1200 

1206 


34 


CLAIMS : 

1. A DNA comprising the nucleotide sequences of the 
Sequences No. 1 and No. 2 in the Sequence Table. 

2. The DNA according to claim 1, wherein the Sequence 
ID No. 1 includes the first exon of the OCIF gene and the Sequence 
ID No. 2 includes the second, third, fourth, and fifth exons . 

3 . A protein exhibiting the activity of inhibiting 
differentiation and/or maturation of osteoclasts and having the 
following physicochemical characteristics, 

(a) molecular weight (SDS-PAGE) : 

(i) Under reducing conditions: about 60 kD, 

(ii) Under non-reducing conditions: about 60 kD and about 
120 kD; 

(b) amino acid sequence: 

includes an amino acid sequence of the Sequence ID No. 
3 in the Sequence Table, 

(c) affinity: 

exhibits affinity to a cation exchanger and heparin, and 

(d) heat stability: 

(i) the osteoclastogenesis-inhibitory activity is 
reduced when treated with heat at 70°C for 10 minutes 
or at 5 6°C for 30 minutes, 

(ii) the osteoclastogenesis-inhibitory activity is lost 
when treated with heat at 90°C for 10 minutes. 

4. A process for producing a protein exhibiting an 
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activity of inhibiting differentiation and/or maturation of 
osteoclasts and having the following physicochemical 
characteristics, 

(a) molecular weight (SDS-PAGE) : 

(i) Under reducing conditions: about 60 kD, 

(ii) Under non-reducing conditions: about 60 kD and about 
120 kD; 

(b) amino acid sequence: 

includes an amino acid sequence of the Sequence ID No. 
3 of the Sequence Table, 

(c) affinity: 

exhibits affinity to a cation exchanger and heparin, and 

(d) heat stability: 

(i) the osteoclastogenesis-inhibitory activity is 
reduced when treated with heat at 70°C for 10 minutes 
02: at 56°C for 30 minutes, 

(ii) the osteoclastogenesis-inhibitory activity is lost 
when treated with heat at 90°C for 10 minutes, 

the process comprising inserting a DNA including the nucleotide 
sequences of the sequences No. 1 and No., 2 in the Sequence Table 
into an expression vector, producing a vector capable of 
expressing a protein having the above-mentioned 
physicochemical characteristics and exhibiting the activity of 
inhibiting differentiation and/or maturation of osteoclasts, 
and producing this protein by a genetic engineering technique. 
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ABSTRACT OF THE DISCLOSURE 
DNAs having the nucleotide sequences of the Sequences No. 
1 and No. 2 in the Sequence Table and a process for producing 
a protein which comprises inserting these DNAs into expression 
vectors to thereby produce a protein having molecular weights 
of about 60 kD (under reductive conditions) and about 60 kD and 
120 kD (under non-reductive conditions) and being capable of 
inhibiting formation of osteoclast. These proteins are useful 
in the treatment of osteoporosis and rheumatism. 
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